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A STRUCTURE THEOREM FOR SETS OF SMALL POPULAR 
DOUBLING, REVISITED 


PRZEMYSLAW MAZUR 


Abstract. We prove that every set A C Z/pZ with E^; min(lyi * 1a(x), t) ^ 
(2 + 5)tExlA{'x) is very close to an arithmetic progression. Here p stands for 
a large prime and (5, t are small real numbers. This shows that the Vosper 
theorem is stable in the case of a single set. 


1. Introduction 

In the recent paper |MazI5] we proved a structure theorem for sets of integers 
having small popular doubling. We were aiming to extend this theorem to make 
it also work for sets of residue classes modulo a prime. Unfortunately we were 
unable to achieve this using the methods of that paper. In this paper we prove that 
missing statement using entirely different methods. To be more specific, our goal 
is to prove the following statement. 

Theorem 1.1. Let 0 < ai < a 2 < \ and rj > 0. Then there exist positive 
real numbers So = So(ai,a 2 ,r/), C = (7(01,02,??) and po = po(cii, Cf2, ??) with the 
following properties. Let p > po be a prime and let A C Z/pZ be a set. Suppose 
that the density a = ^ satisfies oi < o < 02 . Furthermore, suppose that 

Ea; min(lai * lA(a?), t) ^ {2 + 5)at 

for some numbers S € (0,i5o) and t € (O,to(cu,ci2,?7,<5)). Then there is an arithme- 
tuc progression P with |P| <(! + (! + r])d)ap and |A \ P| < (7(i5o)^/^p. 

To fix the notation, let us use the Haar probability measure on all groups appear¬ 
ing in this paper. That means that the symbol E^, used above is just a shorthand 
for ^ arid by / * g{x) we mean ¥,yf(jj)g{x - y). 

The dependences in the statement of Theorem 11.11 look rather complicated, let 
us justify them a little bit. If we were dealing with sets satisfying just |A -b A| ^ 
{2 + S)ap, then the correct bound for the size of P would be |P| = |^+^| —1^| + 1 = 
(1-1- 6)ap + 1 (see |SZ09| for details). The parameter rj indicates that we can make 
as small error as we like, even in terms of S, but to achieve that the popularity 
parameter t has to be sufficiently small, in terms of both ?? and S. Ideally we would 
like to conclude that |P| ^ ((1 + S)a + 0{t))p (see IMazlSp . but with our methods 
we are unable to achieve that. 

2. Proof of the main result 

Let us start with fixing all the parameters and the set A C Z/pZ satisfying the 
assumption. We intend to apply the arithmetic regularity lemma (Theorem IA.9I) 
to the function / = 1^. Let e > 0 and P be a growth function to be specified later. 
Then we can write / = fstr + /smi + /unf, as in the statement of Theorem lA.91 Let 
us first get rid of the function /unf- 

Lemma 2.1. Let g,h : lip’ll -A C be functions. Then the following inequality 
holds: 

\\g * h \\2 < ||p||j/ 2 ||/i ||[/2 


1 
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Proof. Using Parseval’s identity and the relation between the convolution and 
Fourier transform we get 

ii5 * h\\i = XI = X 

r r 

On the other hand, we know that 

= X and ||/i|1^2 = X 

r r 


The inequality is then equivalent to the Cauchy-Schwarz inequality in the following 
form: 







□ 


Corollary 2.2. Let 1 a = /str +/smi +/unf as above. Then the following inequality 
holds: 

2 

Ea; min((/str + /sml) * (/str + /sml)) i) ^ (2 + S)at + ' 

Proof. First of all, since for all characters y we have |/unt(x)| = |(/unf,x)| ^ j^{M) ’ 
we can estimate the U"^ norm of f^ni as 


||/unf||^.=Xli-f(x)l' 




1 


(J-(M))- 


'X = 


ll/unfll 


i . Il/untll 


1 


Therefore for any function g : TjIpL —>■ C with ||g||oo ^ 1 the lemma above gives 

1 


ll/unt * sill < ll/unf * 5II2 < ll/unfllu^llsllu^ < ll/unfUu^l 




Applying this to the functions g = 1 a and g = /str + /smi and using triangle 
inequality we get 

2 

II 1a * 1a ~ (/str + /sml) * (/str + /sml)||l ^ ' 

Now we use an easy-to-check inequality |min(a,t) — min(6,t)| ^ |a — 5| for a = 
1.4 * lA(ai) and b = (/str + /smi) * (/str + fsm\){x) for any x G Z/pZ. Combining 
them with another instance of triangle inequality yields 

2 

|Ea, min(1.4 * 1.4(x), t) E^, min((/str T /smi) * (/str T fsm\)^i)\ ^ 


(J-(M))i/2’ 


which gives the result. 


□ 


We managed to remove /unf from our considerations, now it is time for /smi- To 
deal with this, let A be a small quantity to be specified and let B = {x G TjIjiL : 
ll‘i^(2^)ll ^ a Bohr set. Recall that ip is the homomorphism used to construct 

/str, for the definition of ||(p(a;)||, see the appendix. Now let 

C = {a; G Z/pZ :/str(ai) > A and Ey(zB\fs-mi{x + y)\‘^ ^ s}. 

Intuitively, we take all elements where /srt is somewhat large and where /smi is 
too small to destroy that. First of all, let us estimate the size of C. The set 
C' = {x G JjlyL : fstr{x) ^ A} has size at least fstr{x) — Ap, since 

'^fstr{x) = X fstr{x) + X Ur{x) ^ |C"| + Ap. 
xGC' x^C 


X 
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On the other hand, the set C" = {x € Z/pZ : EygB|/snii(x + y)\^ > e} has size at 
most ep, because 

e" > + y)\^) ^ 

P 

Therefore the size of C can be estimated as \C\ = |C" \ C"'| > |C"| — |C"'| ^ 
fstrix) — (A + e)p. To make it more explicit, note that from the construction of 
/unf we see that Ex/unf(a;) = 0. That leads to Ex(/str + fsm\){x) = ExlA(a;) = a; 
combining it with |Ex/smi(a;)| ^ ||/smi||i < ll/smilU ^ e we get Ex/str(a;) > a - e. 
In the end it means that \C\ ^ (a — 2e — \)p. 

Now it is time to see the reason why we defined the set C in this way. To see 
this, let xi,X 2 € C and consider four functions: fi, f 2 , gi, 92 : B ^ C defined as: 

fi(x) = fstrixi + (-l)*x), 5 i(x) = fsmlixi + (-l)*x). 

Since fstr + /smi is a nonnegative function, we have the inequality 

(/str T /sinl) * (/str T /sml)(:^l “t” X 2 ) ^ 

^ /*sml)(^l ^)(/*str T fsm\)ix2 x)li 3 (x) — 

= -—^(/i + 5 ii /2 + 92 )- 

p 

Now from the Lipschitz nature of F and the definitions of B and C we know 
that /i(x),/ 2 (x) ^ -I for all x £ B, which leads to (/i,/ 2 ) ^ x- Moreover 
since ||/i ||2 ^ ll/i||oo ^ 1 and ||gi ||2 ^ v/e (by the definition of C), we also have 
I(/i 152 )I, |(/ 2 ,ffi)| ^ \/e and |(( 7 i,g 2 )| ^ £■ Combining all the inequalities together 
we get (/i +gi, /2 + 92 ) ^ X ~ 2-ye — e. Also, from the properties of Bohr sets (see 
for example |TVn6p we know that ^ ^ ■ Therefore if only 

* ^ ( 2 l 7 )^(x“ 2 \/e-e), we have just proved that (/str + /smi)* (/str + /smi)(a:) ^ t 
for all X G C + C. Since 

2 

Ex min((/str + /sml) * (/str + /sml)) 0 ^ (2 + S)o't + jl/2 ’ 

we know that in this case we have [C + (7| < ((2 + 6)a + )p- 

The main term of the above expression is 2ap, while the main term of the 
expression bounding the size of C is ap. Therefore if the error terms are sufficiently 
small, we can make use of Serra-Zemor Theorem (proven in [SZOfl] ! and conclude 
that the set C is contained in an arithmetic progression P C Z/pZ of size |P| = 
IC + Cl — |C| + 1. We can assume without loss of generality that |P| ^ |A| as we 
can extend P if necessary. We will come back later to the conditions that must be 
satisfied, let us now proceed with the proof. 

We will examine how the progression P is related to the set A. First of all, since 
C C P, we know that there can only be ep elemets x outside P for which /str ^ A. 
Therefore we have the inequality 

Ex max((/str - lp)(a;), 0) ^ e + A. 

This means that we also have Ex max((lp — /str)(x), 0) < Ex (Ip — /str) (a:) + e + A. 
Adding those two inequalities we get ||lp — /str||i ^ Ex (Ip — fstr){x) + 2(6: + A). 
The last quantity is then an upper bound for the absolute value of the difference of 
corresponding Fourier coefficients of Ip and /str- In other words, |(lp — /strjX)l ^ 
Ex(lp — fstr){x) + 2(e + A) for each character y. On the other hand, we know that 

I(1a ~ /stD x)l ^ |(/smL x)l T |(/unf) x) I ^ + F(^M) 
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for each character By triangle inequality it means that for every character % the 
following holds: 


|(lp ~ 1A) x)l ^ ]Ea;(lp ~ /str)(a:) + 3^ + 2A + 


1 


J-(M) 




^ E 2 ;(lp — l^)(a:) + 4s + 2A + 


J-(M)- 


Recall now that P is an arithmetic progression, so one of its (non-trivial) Fourier 
coefficients is as large as it could possibly be, more precisely there exists xi with 


|ip(xi)l = l(lp,Xi)l = 


“(f) 


psm 


Now let z be a unit complex number satisfying zlp(xi) = |lp(x)|- Since Ip —1 a = 
2 • Ip — Iahp ~ IauP) we have the following lower bound: 

Kip - 1 a,x)I ^ • (2 • Ip - Iahp - 1aup,x)) ^ 




2 sin - sin ( d^uPJ-i). ^ 


Psin(f) 

We can rearrange the numerator of the last expression as follows 

I Mzihl _ ( 1 -^ u 


P 

= 4 sin 


P 


2p 


Sin 




PI • TT 


V 2p 


sin 


(|Al| + |P|-2)7r 


+ 2 sm ( --- I cos 

V 2p 


2p 

(|A| + |P|-2K 

2p 


Now we are almost ready to estimate the size \A \ P\. First of all, since |P| ^ |zl|, 
the last summand is positive and can be discarded, leaving us with the inequality 

f i\A\ + \P\-2U ^ 


Ea:(lp~ 1 a)(2:)+4£+2A + 


1 

P{M) 


> 


4sin(^^) sin(m^ 


sin sin 


V 


2p 


psm 


(f) 


If only 4e + 2A + ^ (1 ~ r])Sa and |P| < (1 + (1 + ri)S)ap, we have that the 

left hand side is bounded by 26a. On the other hand, if we had |^\P| > C((5a)i/^p, 
then the same would hold for |P \ p 1|. Knowing the behaviour of sine around 0, 
we would argue that the first two factors in the numerator are at least 
for some other constant C. But the last factor is bounded away from 0 (as |^| 
and |P| are bounded away from both 0 and |) and the denominator is around tt 
( wd.o.g. > 3), so this would contradict our inequality. In the end we need to have 
|y4\PK C{a5Y/^p. 

The Theorem is now proven up to checking that we can choose all the constants 
to make the calculations work. First of all, we would like to use the Serra-Zemor 
Theorem for the set C. We had jC + C| ^ ((2 + S)a + t(p(M))i /2 )p and jCI ^ 
(a — A — e)p. To make sure that \C + C\ < (2 + 10“‘*)|C'| we want to require 
A,£ < 10 “®q;, 5 < 10“S and t(P(M))i/^ > 10®a“i (say). Then, we would like to 
have jC + Cj — |C| + 1 = |P| <(! + (! + p)5)ap. This rearranges to 
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For this it would be enough if A, £, ^ Moreover, we need t < 

(w)^(x “2\/£-£). This suggests setting e = ^ and requiring t 

We have just listed all the requirements and now the strategy is as follows. Set 

A = (we can freely assume rj < 10“® to make sure that A < 10“®a) and £ = 

Now the only thing is to make sure that might 

seem impossible, as the upper bound on M depends of J- and we might not be 
able to fit into the correct range. The solution to this problem is the following: 
suppose that the above inequalities hold for some other number t'. Then the entire 
argument is correct assuming that the initial inequality describing popular doubling 
of A holds with parameter t' instead of t. A similar argument to [MazlSl Corollary 
3.5] shows that this is indeed the case for any t' ^ t. This suggest the following 
strategy: 

• given ai, 02 , r], choose Jq > 0 so that (1 + (1 + r])So)a 2 < ^ (to make sure 
all the hyptheses of Serra-Zemor Theorem are satisfied), 

• given S G (0, Jo), set A = and e = 

• define J^(M) = and apply the arithmetic regularity lemma 

to get an upper bound M ^ Mg, 

• set to = 

Then we can hnd t' ^ to with the postulated properties. Since t ^ to-, we also have 
t ^ t', as required. That ends the proof of Theorem ll.il 

Appendix A. Arithmetic regularity lemma 

In the appendix we give a self-contained proof of the arithmetic regularity lemma 
for t/^ norm. The lemma was proven in full generality (i.e. for norm for arbitrary 
k) by Green and Tao in [GTlOj . There is also an exposition by Eberhard of the 
t/^ case. Unfortunately both of them have a feature that is a disadvantage for 
us, namely they deal with functions defined on {1,..., N} rather than Z/pZ. As a 
result the “structured part” obtained there comes from a Lipschitz function defined 
on [0,1] X Z/gZ X T'^ (in the case). However, as we work over a cyclic group 
of prime order, in our setting everything is periodic (mod p) and there is no room 
either for non-periodic behaviour (such as [0,1]), or periodic behaviour modulo 
other numbers. Therefore we are aiming for a slightly different statement of the 
regularity lemma, but the methods of proof remain the same. 

Let us by an pre-character on a group G mean a homomorphism ip : G —>■ T and 
by a character a homomorphism x '■ G ^ {z G C : |z| = l}. Of course there is 
one-to-one correspondence between those, given by the equation x = This 

terminology is by no means standard and is used only for the purpose of this paper. 

Before we start, let us fix some notation. For any set F of pre-characters on 
Z/pZ and any positive integer n, we define a partition B = B(T,n) of Z/pZ into 
cells. Intuitively, B corresponds to the partition of the torus into cubes of 
side length More formally, two points x,y G "LIp^L belong to the same cell if 
p{x),ip{y) G [^, ^'^^^ ) C T for some G Z for all p G F. Note that if F C F' and 
n\n', then B{T', n') is a refinement of H(r, n), i.e. each cell of the former is a union 
of cells of the latter. 

For any function / : Z/pZ —> C and any partition B = B{T,n) define the 
conditional expectation E(/|H) in the standard way, i.e. E(/|i3)(a;) is the average 
of / on the cell of B containing x. In other words, E(/|H) is just the orthogonal 
projection of / onto the space of all H-measurable functions (constant on every cell 
of B). Note that if B' is a refinement of B then E(E(/|H')|H) = E(/|H) and more 
generally E(/ • Eig\B)\B') = E{f\B')Eig\B). 
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Lemma A.l. Let n > 0 and let ip : Z/pZ T he an pre-character and let 
^ T be a set of characters containing ip and let B = B{T,n). Suppose 

that f : 'Ljplj <C is a function with ||/||oo ^ 1 - Then 

O-TT 

|(/-E(/|S),x)|< — 

n 

Proof. The key idea is that x is almost constant on each cell of B. More precisely, 
by the properties of orthogonal projections we have 

(/-E(/|S),x) = (/,x-E(x|S)). 

But since ip GT, the function y —E(x|,B) is bounded pointwise by |1 —^ ^ 
and the claim follows. □ 

Corollary A.2. Let d > 0 and let / : Z/pZ —>■ C with ||/||oo ^ 1- Then there 
exists a set T of pre-characters of size |r| ^ ^ and n ^ ^ such that for B = B{T, n) 
and any character x we have 

(1) |(/-E(/|S),x)K^. 

Proof. Define n = and build the set T iteratively, at the beginning T = 0. At 
each stage we ask if the inequality o is satisfied for every character. If so, we finish 
our procedure, otherwise we take a character x for which the inequality fails and add 
the corresponding pre-character ip to T. Let B = B{r,n) and B' = BiV U {ip},n). 
By the previous lemma we know that 

|(/-E(/|S'),x)K-^^. 

n 2 

Combining this with the initial assumption on x triangle inequality gives 

mm')-Eif\B),x)\^^^. 

Now we use Cauchy-Schwarz and the fact that E(/|i3) is an orthogonal projection 
of E(/|,B') (as B' is a refinement of B): 

\mf\B')\\i - immwi = \mf\B')-Eif\B)\\i ^ i(e(/is') -e(/is),x)P ^ 

In other words, it means that adding to T the pre-character corresponding to x 
increases the value of ||E(/|i3)||2 by at least Since this quantity can only take 
values between 0 and I, this procedure must terminate in at most ^ steps. In the 
end we get a set T of size at most ^ satisfying the inequality ([T]) for each character 

X- 

It remains to check that the bound on n is correct. For that we can freely assume 
i5 ^ I, which implies ^ ^ 47r > ^. However, for any such number x we have the 
bound [a:] < ^x < ^x. □ 

The corollary above says that we can get rid of any large Fourier coefficients using 
only projections of bounded complexity. The heart of the arithmetic regularity 
lemma is to iterate this argument. Before we do that, le us explain what a growth 
function is. A growth function is simply an increasing function P : (0, -l-oo) —> 
(0, -boo), typically describing how large we need one parameter to be in terms of 
another parameter. In most applications one can think of T as of an exponential 
function x >->■ . 

Proposition A.3 (arithmetic regularity lemma: baby version). Let e > 0 and let 
P be a growth function. Then there exixts a real number Mq = Mq^SjP) > 0 for 
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which the following statement is true. Let p be a prime and f : hjp’L —>■ [0,1] he a 
function. Then there exists a number 0 < M ^ Mq and a decomposition 

f — fstr fsml funi 

satisfying the following properties: 

• /str = ]E(/|S), where B = B{T,n) for some set F of pre-characters and 
some positive integer n with |r|,n ^ M (fstr is structured), 

• ||/smi||2 ^ e (fsml is Small), 

• K/unf,x)| < for every character x (funl is U'^-uniform). 

• /str and fstr + fsml both take values in [0,1]. 

Proof. We will again use an iterative procedure. At the beginning, let B be the 
trivial partition, corresponding to F = 0 and n = 1. At each stage, we set M = 
niax(|F|, n) and then apply Corollarv lA.2l with parameter <5 = y^xTj function 

/ — E(/|i3). This way we get a set F' and an integer n', both bounded in terms of 
M and J-. In fact, we need a slightly modified version of this result; to ensure that 
F C F' and n|n', we take at the beginning n ■ instead of and F istead of 
the empty set. This does not affect the boundedness of the final parameters, we 
still have the bounds of the shape |F'|,n ^ P'{M) for some growth function T' 
depending only on T. After applying this procedure we wish to set fstr = ^{f\B), 
fsml = E(/| mathcalB)' - E(/|S) and Ui = / - E(/|S)', where B' = B{T',n'). 
All the required conditions are clearly satisfied except one: it might happen that 
ll/smilb > £■ To take care of it, we use iteration: if this actually happened, set new 
F := F' and n := n'. Again, we can argue that since B' is a refinement of B, we 
have 

iiE(/|s')ii2 - wmmi = \mf\B') - Eimm = 

and therefore each iteration increases the value of ||E(/|,B )||2 by at least so we 
cannot have more than iterations in total. It means that in the end the result 
is true with Mq = T'{. .. (J^'(l)) .. .) . □ 

V ^ > 

L^J iterations 


The above version of the arithmetic regularity lemma is not quite satisfactory, as 
we expect to have a slightly different kind of structure for fstr- Before we explain, 
how to fix that, let us expliot some properties of Fejer kernel. 


Lemma A.4. Let d and K be positive integers, let Xj : —>■ C (j = 1,... ,d) be 

the basic characters defined as Xj(i) = : T —>■ [0, +oo) be the Fejer 

kernel of order K: 






K-l 


T. Ml 


fe=0 


Then f.^.^ ^K(t)dt = 1 and moreover 



^K{t)dt > 1 


d 

4A:A2' 


Proof. The first assertion is standard; to prove it one only needs to expand as 
the linear combination of characters and observe that the trivial character comes 
with coefficient 1. To prove the inequality, let us first note that 


K-l 

E 4(‘) 




1 - xfit) 

1 -XjW 




2 

|1 _ e2«tj| 


1 
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Therefore if d = 1, we have 


/[-A, A] 


^K{t)dt = 1 



^K{t)dt ^ 1 


sup ^K{t) = 1 - 
Ilill^A 


1 

4XA2 


. Now for (i > 1, the d-dimensional Fejer kernel is just a product of d copies of a 
l-dimensional one, which gives the bound 


[-A.AJ-i 


^K{t)dt > 1 


1 y ^ 1 d 

AKX^) ^ ~ AKX^ ■ 


□ 


Before stating the next result let us set a default norm on to be the maximum 
norm, i.e. ||t|| = maxi^^^d ||tj||. Consequently, we call a function F : —>• C M- 
Lipschitz, if |F(ti) — F(t 2 )| ^ M\\ti — t 2 \\ holds for all ti,t 2 G T. 

Proposition A.5 (arithmetic regularity lemma: intermediate version). Let £ > 0 
and let F he a growth function. Then there exixts a real number Mg = Mo{e,F) > 
0 for which the following statement is true. Let p > po{e,F) be a prime and 
f : TjIpIj —>■ [0,1] be a function. Then there exists a number 0 < M ^ Mg and a 
decomposition 

f — fstr T fsm\ “f funf 
satisfying the following properties: 

• /str = F o (p, where p : "Lfplj T‘^ is a homomorphism with d ^ M and 

F : —>■ [0,1] is an M-Lipschitz function (fstr is structured), 

• ll/smilb ^ e (fsm\ is small), 

• l(/unf,x)| < for every character x (funi is U'^-uniform). 

• /str and fstr + fsmi both take values in [0,1]. 

Proof. First we apply to / the baby version with some different parameters s' and 
F' to be specified later. We get a decomposition / = /'^j. + f).^^ + now we set 
fnn{ = /'nf and try to find fstr of the new type so that |l/str - /strII 2 is small and 
in the end set fsmi = f)^i + fitr ~ fstr- We know the structure of f)^p, it can be 
alternatively said that = F' o p, where p = ((/?i,..., pd) : Z/pZ —>• T'* is just 
a product of all pre-characters forming F and F' : T'^ —>■ [0,1] is a function that is 
constant on the cubes of the form [^, x • ■ • x [^, for ki,... ,kd G Z. 
The function F' does not need to be unique, we can pick any that fits into the 
formula. Also, it does not need to be Lipschitz and we have to fix that. To do this, 
put F = F' * for some K to be specified. To see that F is Lipschitz, let us 
calculate 


|F(ti)-F(t2)| 


F'(s)($x (tl — s) — ^K{t2 — s)) 


< 


< sup \^K{tl - s) - ^K{t 2 - s)|. 


The above calculation shows that the Lipschitz constant of F is bounded by that of 
^K- To estimate it, let us note that is a linear combination of characters of the 
form rij=i with \kj\ < K, and a character of this particular form comes with 

coefficient n,tr(i- and is itself a |^j |)-Lipschitz function. Therefore 

the Lipschitz constant of is at most 
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To calculate this, let us set = J2ki kd 0^=1 ~ ■ Then one can check 

that those sequenced satisfy the recurrence Ldi+d 2 = LdiL'^^+Ld^L'j^-^ and = 

which together with the boundary conditions Li = ^{K'^ — 1), = K 

gives Ld = ^dK'^-^{K^ - 1) ^ AdK‘‘+^. 

Set /str = F o (p. We would like to bound the expression 

II/',, - /.t,||^ = E,\F\p{x)) - F' * ^Mx))\\ 

Inside the expectation, some of the elements s will lie near the edges of the cubes 
and for them it would be hard to estimate the value \F'{ip{x)) — F' * (</5(a;))| 

other than trivially by 1. Let us estimate the number of such “bad” elements: the 
set of all tj G T with \\tj — ^\\ < A has measure 2A; since p is sufficiently large 
we can assume that the set of all x with \\Lpj{x) — ^\\ ^ A has size at most 4Ap. 
Taking into account all possible values of j and kj we see that all but at most AXdnp 
elements are separated from the boundary of their cubes by at least A. For those 
elements x let us estimate 

\F'{ip{x)) - F'* ^K{pix))\ [ ^K(t)\F'{ip{x)) - F'{ip{x) -t)\dt. 

By the description of x the latter factor is zero on the cube [—A, A]"^; on the re¬ 
maining set it is bounded by 1, so by the previous lemma the value of the integral 
is bounded by In the end, taking into account all values of x, we have an 

estimate 

||/'„ - UWl = E,\F'ipix)) - F' * < 4Adn + ■ 

We wish the last quantity to be at most |; to achieve this set A = and 

Now we return to the beginning, where we had to specify s' and F'. We can 
take e' = f, then ||/smi ||2 ^ ll/smilb + ll/str “ Urh <§ + § = £• To choose F', let 
us first note that 

d 2®d3„2 28 m5 

< _ — _ < _ 

The Lipschitz constant of F is then bounded by 

< 4M f —^ J =:a{M,e). 

It is now enough to take F'{M) = F{a{M,e)) and Mq = Mo{e',F') given by the 
previous version of the lemma. □ 

Now the structure of /str appears to be more natural, although we are still missing 
some information. We would like to know that the image of the homomorphism tp 
is well equidisributed in so that we could expect that /str has roughly the same 
global structure as F. To achieve this, let us set a notion of it'-independence. The 
homomorphism p = (pi ,..., pd) will be called iL-independent if the only solution 
to the equation kipi kdPd = 0 with \kj\ < K is ki = ... = kd = 0. We will 

show how we can require independence, in particular what to do if p turns out not 
to be independent. 

Lemma A.6. Let (ai,...ad) be a vector with integer coordinates. There exists 
a matrix A = [oij] G J\4d(Z) with aij = aj (for j = l,...,d) and satisfying the 
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following properties: 


d 

ajttij = 0 for i = 2,... ,d, 
i=i 


det A = 


E d 2 

.7 = 1 S 


gcd(ai, ...,ad)' 


Moreover, if ma,xjaj\ ^ K, then we can choose the entries of the matrix A to be 
bounded by K. 


Proof. Let us prove the claim by induction on d. For d = 1 the statement is trivial. 
Suppose d> 1 and we have already proved it for d — 1. We would like to extend the 
matrix found for the vector (oi,..., Ud-i) to make it work for (ai,..., ad). Setting 
did = 0 for z = 2,..., d — 1 makes the first property satisfied for those values of i. 
Also, it makes the determinant quite easy to calculate by expanding it with respect 
to the last column. Since 

3=1 S' ^ ^ 3=1 S' . gcd(ai,...,ad-i) Qd 

gcd(ai,... ,ad) gcd(ai,... ,ad_i) gcd(ai,...,Od) gcd(ai,...,Od) ’ 


it looks reasonable to set add = ^gcd^a’i "’°ad) ^ ^ ^ complete the last row 

so that the submatrix B obtained by deleting the first row and the last column has 
determinant G Z. Note that the formula 

gcd{ai,...,ad) 


det B = 


(-l)^Ei^i QjQdi 

gcd(ai,...,ad-i) 


is true if we set adj = aij for some 1 ^ z ^ d — 1 and all j = 1,... d — 1. Since 
the vectors (aij)j~^ span all of (as the determinant of the matrix they form 

is non-zero by the inductive hypothesis), the formula above is in fact true for any 
choice of ad,i, ■. ■, ad,d-i- This is good for us — if we insist that 


d 

3 = 1 


then automatically we have 


d-l 

X + 

3 = 1 


ad gcd(ai,...,ad-i) 
gcd(ai,... ,ad) 


det A = add- 


E d — 1 9 

d=l s 


gcd(ai,. 


■ ,ad-i) 

E d — 1 2 

3 = 1 S 


-b (-1) ad ■ detB = 


E d — 1 
7 = 1 


E d 2 
.7 = 1 «.7 


gcd(ai,... ,ad) gcd(ai,... ,ad-i) 

■yd — 1 


gcd(ai, ...,ad)' 


So the only condition remaining is J2j=i ^^jadj = gcd(aut..ai) ' gcd(ai, • ■ • ,ad-i)- 
This can be satisifed by the Euclidean algorithm since G Z, and thus 

we have proved the existence of the matrix A. 

Now let us consider the bounds for the entries. Obviously if \aj\ ^ K, then 


\add\ = 


gcd(ai,.. 


gcd(ai, 

....ad) 


^ K. Choosing the vector (ad,i,.. ■, ad,d-i) carefully might 


be a little bit more complicated. But if we write the equation in the form 


gcd(ai,...,ad-i) 


ad 

gcd(ai,...,ad)’ 


we can see that the claim boils down to the lemma below. 


□ 
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Lemma A. 7. Let m, K > 0 be integers and let bi,... ,hm be coprime integers not 
exceeding K in absolute value. Let b be an integer with | 6 | ^ K. Then there exist 
integers ci,... ,Cm not exceeding K in absolutee value and satisfying 

bici + ... + bmCm = b. 

Proof. If m = 1, then 6 i = ±1 and thee statement is trivial. For m = 2, if either 
of 61,62 is equal to ±1, the statement is trivial as well. If it is not the case, then 
without loss of generality we can assume 61 > 62 > 0. But then the numbers 6 — kb 2 
for |fc| sC K are at most Kbi in absolute value and at least one of them is a multiple 
of 61 , so the claim follows. Suppose now that m > 2 and we have already proven 
the claim for all smaller values of m. Let g = gcd( 6 i,..., 6 ^- 1 )- Then bm and g are 
coprime and have absolute value at most K, so we can use the claim for m = 2 to 
get gb' + bmCm = b with | 6 '|, \cm\ ^ K- Also the numbers ■ • ■, are coprime 
integers bounded by K in absolute value, which gives us yCi +... + Cm-i = 6 ' 
with |ci|,..., |cm-i| ^ K. Combining those two identities we get the claim. □ 

Lemma lA.61 gives us the following corollary. 

Corollary A. 8 . Let d > 1 be an integer, let (f : Z/pZ —>■ be a homomorphism 
and let F : —>■ C be an M-Lipschitz function. Then at least one of the following 
holds: 

• p is A-independent, 

• there exits a homomorphism p' : Z/pZ —>■ and a dAM-Lipschitz 

function F' : —)> C with F' o ip' = F o ip. 

Proof. Suppose p is not A-independent, i.e. there exist intgers ai,...,ad with 
0 lojl < A. It is not hard to see that the second part is true 
for p ^ A as long as d — 1 ^ 1; suppose then p > A. In that case we are allowed 
to divide all of aj by their greatest common divisor and without loss of generality 
assume gcd(ai,..., Ud) = 1. By Lemma IA. 6 I we can find d — 1 integer vectors 
orthogonal to a = (oi,..., Od) suth that the matrix A consisting of all of them has 
determinant We claim that the Z-span of these d — 1 vectors coincides 

with the intersection of their K-span and Z'^. Indeed, we know that the Z-span of 
all d vectors is a subgroup of Z^^ of index det A = On the other hand, 

the map x >—>■ (a, x) (mod det A) gives rise to a surjective homomorphism from the 
quotient group to a group of size det A. Therefore this homomorphism is in fact 
an isomorphism and its kernel is precisely the Z-span of all d vectors. Intersecting 
it with the R-span of the last d — 1 vecrors obviously gives us their Z-span. But 
this intersection can be easily seen as {x £ lA : (a, x) = 0 } or in other words the 
intersection of the R-span and all of lA. 

Since gcd(ai,... ,ad) = 1, it follows that the set {t S T'^ : (a,t) = 0 S T} is in 
fact the image of the subspace {a: G R^^ : {a,x) = 0 G R} under the projection 
(mod lA). Therefore it can be parametrized as A'(T‘^“^), where A' is (d — 1) x d 
matrix obtained from A by deleting its first row (oi,..., a„). Let F' : C 

and p' : Z/pZ —>■ be functions satisfying F' = F o A' and p = A' o p'. Note 

that p' is well defined and is a homomorphism. Then 

F' o p' = F o A' o p' = F o p. 

The only thing left is to estimate the Lipschitz constant of F'. Since A' has entries 
bounded by A, it can be viewed as a dA-Lipshitz function. Composing it with an 
M-Lipshitz function F gives us a function of Lipschitz constant at most dKM. □ 

Now we are ready to give a proof of the full version of the regularity lemma (in 
the {7^ case). 
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Theorem A.9 (arithmetic regularity lemma: final version). Let £ > 0 and let T be 
a growth function. Then there exixts a real number Mq = > 0 for which 

the following statement is true. Let p > po{e, IF) be a prime and f : Z/pZ —>■ [0,1] 
he a function. Then there exists a number 0 < M ^ Mq and a decomposition 

f — fstr “t fsml “t funi 

satisfying the following properties: 

• /str = F o T, where ip : Z/pZ T‘^ is a F{M)-independent homomor¬ 
phism with d ^ M and : T'^ —>■ [0,1] is an M-Lipschitz function (fsti is 
structured), 

• ||/smi||2 ^ e (fsml IS Small), 

• l(/unf,x)| < for every character x (funi is U'^-uniform). 

• /str and /str + fsml both take values in [0,1]. 

Proof. Let us start with applying the previous version of the regularity lemma with 
the same parameter e and a different growth function F' to be specified. We stick 
to the obtained decomposition / = /str + fsml + /unf but we would like to exploit 
more properties of F and ip. If p is independent, we are done. Otherwise we use 
Lemma Fa. 81 to decrease the dimension d by 1 at the cost of potentially increasing 
the Lipschitz constant up to dMF{M) ^ M'^F{M). Put Fi{M) = M^F{M); since 
this procedure can be applied at most d ^ M times, so the correct choice of F' is 
F'iM)=F{Fii...iFiiM))...)). □ 

[MJ times 


To make a proper use of the above result, we often need to relate the behaviours 
of /str and F. Below we prove a statement of this kind. 


Lemma A.10. Let d be a positive integer, p be a prime, let p\,... ,pd '■ 'Ljp'L —>■ T 
he pre-characters and let p = {pi, ..., pd) : Z/pZ —>■ be their product. Suppose 
that the set {pj}‘j=i is K-independent. Let F : ^ C be an M-Lipschitz function. 

Then 


V.,^F{p{x)) 




Proof. Let Xj : —>• C defined via x(ti ... ,td) = be the basic characters 

and let : T —>■ C be the Fejer kernel defined via the formula 


^K{t) 


1 


d 


n 




Recall that ^K{t)dt = 1. We also have the bound 


K-l 


E 4 (^) 


1 - xf if) 

1 -XiW 




1 - 


1 


Therefore ^K{t) ^ 0^=1 4 Kl\tjP ■ Having this inequality we would like to show 
that F{t) and F * ^K{t) are close together for any t G T‘^. Let us estimate their 
difference: 


\F{t)-F*<^>K{t)\ 


[ {F(t) - F{t- s))^K{s)ds 
Jfi 


if M ||s||$_R-(s)(is. 


This is already independent of t; now we make use of the fact that $r:(s) is large 
precisely when ||s|| is small; more accurately ||s|| ^ u implies $r-(s) < for 
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any u ^ 0. Combining this with = 1 leads to the inequality 

r r /•1/2 


/ \mK{s)ds = 

Ijd Jjd Jq 

fl/2 n 


^{u^\\s\\}^Kis)duds = 

. 1/2 


min 1 


Ifd 


1 


’ AKu"^ 


du ^ 




and consequently \F{t) —F*^K{t)\ < for any t G In particular, the triangle 
inequality yields |E2;i^((/9(a;)) — ^ Let us now expand the last 

expression: 


Ea;J^ * $R(/j(a;)) = Ea; F{t)^K{'^{x) - t)d?j = 

= [ F{t) (E2,$k(V3(x) - t)) .dt 

Jjd 

Expanding out the formula for we can see that the average — t) is 

a sum of the averages of the form c-¥.xX{‘P(x) — t), where x(0 — Il/^i Xj^ some 
ttj G {\ — K,..., —1, 0,1,..., iG — 1}. But from the iG-independence of {p we 
can see that x{v{x) — t) is never a constant function in x and therefore has average 
0, unless x is a trivial chatracter with ai = ... = = 0, in which case c = 1. In the 

end, Ex^Ki‘p{x) — t) = I for all t G T"^, which leads to ExF*^k{‘p{x)) = F{t)dt. 
Plugging this formula into the previous inequality, we get the desired result. □ 

One may wonder how we managed to prove the above lemma without assuming 
that p is large. In fact, the ib-independence of a set of d homomorphisms carries a 
hidden assumption p ^ K'^. 

In this paper we did not need the above result, or even the full version of the 
regularity lemma (the intermediate version would be enough), but in general it 
might be useful to have them around. 
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